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Abstract 

This paper proposes decoupling the dpppndpncy 



t-rpp from wnrrl nrr\pr snrh that snrfarp nrHpr- 

ing is not determined by traversing the depen- 
dency tree. We develop the notion of a word 
order domain structure, which is linked but 
structurally dissimilar to the syntactic depen- 
dency tree. The proposal results in a lexicalized, 
declarative, and formally precise description of 
word order; features which lack previous pro- 
posals for dependency grammars. Contrary to 
other lexicalized approaches to word order, our 
proposal does not require lexical ambiguities for 
ordering alternatives. 

1 Introduction 

Recently, the concept of valency has gained con- 
siderable attention. Not only do all linguistic 
theories refer to some reformulation of the tra- 
ditional notion of valency (in the form of 8- 
grid, subcategorization list, argument list, or 
extended domain of locality); there is a grow- 
ing number of parsers based on binary relations 
between words ( [Eisner, 1997 ; Maruyama, 1990 ). 

Given this interest in the valency concept, 
and the fact that word order is one of the 
main difference between phrase-structure based 
approaches (henceforth PSG) and dependency 
grammar (DG), it is valid to ask whether DG 
can capture word order phenomena without re- 
course to phrasal nodes, traces, slashed cate- 
gories, etc. A very early result on the weak 
generative equivalence of context-free grammars 
and DGs suggested that DGs are incapable of 
describing surface word order ( Gaifman, 1965 ). 
This result has recently been critizised to ap- 
ply only to impoverished DGs which do not 
properly represent formally the expressivity of 



Our position will be that dependency re- 
lations are motivated semantically ([Tesniere, 



1959 ), and need not be projective (i.e., may 



cross if projected onto the surface ordering). We 
argue for so-called word order domains, consist- 
ing of partially ordered sets of words and associ- 
ated with nodes in the dependency tree. These 
order domains constitute a tree defined by set 
inclusion, and surface word order is determined 
by traversing this tree. A syntactic analysis 
therefor consists of two linked, but dissimilar 
trees. 

bee. | will briefly review approaches to word 
order in DG. In Sec. 3, word order domains will 
be defined, and Sec. I] introduces a modal logic 
to describe dependency structures. Sec. [| ap- 
plies our approach to the German clause and 
Sec. [6| relates it to some PSG approaches. 

2 Word Order in DG 

A very brief characterization of DG is that 
it recognizes only lexical, not phrasal nodes, 
which are linked by directed, typed, binary rela- 
tions to form a dependency tree ( Tesniere, 1959| ; 
Hudson, 1991) . The following overview of DG 
flavors shows that various mechanisms (global 
rules, general graphs, procedural means) are 
generally employed to lift the limitation of pro- 
jectivity and discusses some shortcomings of 
these proposals. 

Functional Generative Description (Sgall 



|et al., 1986| ) assumes a language-independent 
underlying order, which is represented as a pro- 
jective dependency tree. This abstract repre- 
sentation of the sentence is mapped via ordering 
rules to the concrete surface realization. Re- 



contemporary DG variants ( Neuhaus Broker 
1997b . 



cently, [Kruijff (1997[) has given a categorial- 
style formulation of these ordering rules. He 
assumes associative categorial operators, per- 
muting the arguments to yield the surface or- 



dering. One difference to our proposal is that 
we argue for a representational account of word 
order (based on valid structures representing 
word order) , eschewing the non-determinism in- 
troduced by unary operators; the second differ- 
ence is the avoidance of an underlying structure, 
which stratifies the theory and makes incremen- 
tal processing difficult. 



Meaning- Text Theory flMelc'uk, 1988|) as- 
sumes seven strata of representation. The 
rules mapping from the unordered depen- 
dency trees of surface-syntactic representations 
onto the annotated lexeme sequences of deep- 
morphological representations include global or- 
dering rules which allow discontinuities. These 
rules have not yet been formally specified 
flMelc'uk fc Pertsov, 198gp .l87f). 

Word Grammar (WG, [Hudson (1990|) ) is 

based on general graphs instead of trees. 
The ordering of two linked words is specified 
together with their dependency relation, as 
in the proposition "object of verb follows 
it". Extraction of, e.g., objects is analyzed 
by establishing an additional dependency called 
visitor between the verb and the extractee, 
which requires the reverse order, as in "visitor 
of verb precedes it". This results in incon- 
sistencies, since an extracted object must follow 
the verb (being its object) and at the same 
time precede it (being its visitor). The ap- 
proach compromises the semantic motivation 
of dependencies by adding purely order-induced 
dependencies. WG is similar to our proposal in 
that it also distinguishes a propositional meta 
language describing the graph-based analysis 
structures. 

Dependency Unification Grammar 

(DUG, Hellwig (1986| )) defines a tree- like data 
structure for the representation of syntactic 
analyses. Using morphosyntactic features with 
special interpretations, a word defines abstract 
positions into which modifiers are mapped. 
Partial orderings and even discontinuities can 
thus be described by allowing a modifier to 
occupy a position defined by some transitive 
head. The approach requires that the parser 
interpretes several features specially, and it 
cannot restrict the scope of discontinuities. 

Slot Grammar ( |McCord, 199C| ) employs a 
number of rule types, some of which are ex- 



clusively concerned with precedence. So-called 
head/slot and slot/slot ordering rules describe 
the precedence in projective trees, referring to 
arbitrary predicates over head and modifiers. 
Extractions (i.e., discontinuities) are merely 
handled by a mechanism built into the parser. 

3 Word Order Domains 

Summarizing the previous discussion, we re- 
quire the following of a word order description 
for DG: 

• not to compromise the semantic motivation 
of dependencies, 

• to be able to restrict discontinuities to cer- 
tain constructions and delimit their scope, 

• to be lexicalized without requiring lexical 
ambiguities for the representation of order- 
ing alternatives, 

• to be declarative (i.e., independent of an 
analysis procedure), and 

• to be formally precise and consistent. 

The subsequent definition of an order domain 
structure and its linking to the dependency tree 
satisify these requirements. 

3.1 The Order Domain Structure 

A word order domain is a set of words, general- 
izing the notion of positions in DUG. The car- 
dinality of an order domain may be restricted 
to at most one element, at least one element, 
or - by conjunction - to exactly one element. 
Each word is associated with a sequence of order 
domains, one of which must contain the word 
itself, and each of these domains may require 
that its elements have certain features. Order 
domains can be partially ordered based on set 
inclusion: If an order domain d contains word 
w (which is not associated with d), every word 
w' contained in a domain d' associated with w 
is also contained in d; therefor, d' C d for each 
d' associated with w. This partial ordering in- 
duces a tree on order domains, which we call 
the order domain structure. 

Take the example of German ll Den Mann hat 
der Junge gesehen" ("the manAcc ~ has - the 
boy no M " seen'''). Its dependency tree is shown 
in Figjl], with word order domains indicated 



'hat d 2 ~ ~ ~ " ~ - - - 

der Junge; ; gesehen/'! ! 



ticiple to follow subject and object, resp.J 
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Figure 1: Dependency Tree and Order Domains 
for "Den Mann hat der Junge gesehen" 
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Figure 2: Order Domain Structure for "Den 
Mann hat der Junge gesehen" 

by dashed circles. The finite verb, "hat", de- 
fines a sequence of domains, ((^1,^2,^3), which 
roughly correspond to the topological fields in 
the German main clause. The nouns "Mann" 
and "Junge" and the participle "gesehen" each 
define one order domain (d^cfe,^, resp.). Set 
inclusion gives rise to the domain structure in 
Figjj2|, where the individual words are attached 
by dashed lines to their including domains {d\ 
and collapse, being identical) .[] 

3.2 Surface Ordering 

How is the surface order derived from an or- 
der domain structure? First of all, the ordering 
of domains is inherited by their respective ele- 
ments, i.e., "Mann" precedes (any element of) 
d%, "hat" follows (any element of) d±, etc. 

Ordering within a domain, e.g., of "hat" and 
d§, or ds and d%, is based on precedence pred- 
icates (adapting the precedence predicates of 
WG). There are two different types, one order- 
ing a word w.r.t. any other element of the do- 
main it is associated with (e.g., "hat" w.r.t. d§), 
and another ordering two modifiers, referring to 
the dependency relations they occupy (cfe and 
de, referring to subj and vpart). A verb like 
"hat" introduces two precedence predicates, re- 
quiring other words to follow itself and the par- 



Note that in this case, we have not a single rooted 
tree, but rather an ordered sequence of trees (by virtue 
of ordering d\,d,2, and ^3) as domain structure. In gen- 
eral, we assume the sentence period to govern the finite 
verb and to introduce a single domain for the complete 
sentence. 



"hat" 



(<* A (vpart) > {S ubj,obj}) 



Informally, the first conjunct is satisfied by 
any domain in which no word precedes "hat", 
and the second conjunct is satisfied by any do- 
main in which no subject or object follows a par- 
ticiple. The domain structure in Fig.§ satisfies 
these restrictions since nothing follows the par- 
ticiple, and because "den Mann" is not an ele- 
ment of d%, which contains "hat" . This is an im- 
portant interaction of order domains and prece- 
dence predicates: Order domains define scopes 
for precedence predicates. In this way, we take 
into account that dependency trees are flatter 
than PS-based ones[] and avoid the formal in- 
consistencies noted above for WG. 

3.3 Linking Domain Structure and 
Dependency Tree 

Order domains easily extend to discontinuous 
dependencies. Consider the non-projective tree 
in Fig.[l]. Assuming that the finite verb gov- 
erns the participle, no projective dependency 
between the object "den Mann" and the par- 
ticiple "gesehen" can be established. We al- 
low non-projectivity by loosening the linking be- 
tween dependency tree and domain structure: 
A modifier (e.g., "Mann") may not only be in- 
serted into a domain associated with its direct 
head {"gesehen"), but also into a domain of a 
transitive head {"hat"), which we will call the 
positional head. 

The possibility of inserting a word into a do- 
main of some transitive head raises the ques- 
tions of how to require contiguity (as needed 
in most cases), and how to limit the distance 
between the governor and the modifier in the 
case of discontinuity. From a descriptive view- 
point, the syntactic construction is often cited 
to determine the possibility and scope of discon- 
tinuities flBhatt, 199C| ; [Matthews, 19811 ). In PS- 
based accounts, the construction is represented 
by phrasal categories, and extraction is lim- 
ited by bounding nodes (e.g., Haegeman (1994] ) , 
Becker et al. (1991 )). In dependency-based ac- 
counts, the construction is represented by the 



2 For details of the notation, please refer to Sec. 

3 Note that each phrasal level in PS-based trees de- 
fines a scope for linear precedence rules, which only apply 
to sister nodes. 



dependency relation, which is typed or labelled 
to indicate constructional distinctions which are 
configurationally defined in PSG. Given this 
correspondence, it is natural to employ depen- 
dencies in the description of discontinuities as 
follows: For each modifier of a certain head, a 
set of dependency types is defined which may 
link the direct head and the positional head of 
the modifier ("gesehen" and "hat", resp.). If 
this set is empty, both heads are identical and a 
contiguous attachment results. The impossibil- 
ity of extraction from, e.g., a finite verb phrase 
may follow from the fact that the dependency 
embedding finite verbs, propo, may not appear 
on any path between a direct and a positional 
headfj 

4 The Description Language 

This section sketches a logical language describ- 
ing the dependency structure. It is based on 
modal logic and owes much to work of |Black-| 
burn (1994 ). As he argues, standard Kripke 



models can be regarded as directed graphs with 
node annotations. We will use this interpreta- 
tion to represent dependency structures. De- 
pendencies and the mapping from dependency 
tree to order domain structure are described by 
modal operators, while simple properties such 
as word class, features, and cardinality of order 
domains are described by modal propositions. 

4.1 Model Structures 

In the following, we assume a set of words, W, 
ordered by a precedence relation, -<, a set of 
dependency types, V, a set of atomic feature 
values A, and a set of word classes, C. We define 
a family of dependency relations Rd C W X 
W, d £ T> and for convenience abbreviate the 
union {Jdev R d as Rv- 

Def: A dependency tree is a tuple 

(W, w r , Rx>, V_a, Vc), where Rx> forms a tree 
over W rooted in w r , Vj\_ : W i— > 2^ maps words 
to sets of features, and Vc : W i— > C maps words 
to word classes. 



4 One review pointed out that some verbs may allow 
extractions, i.e., that this restriction is lexical, not uni- 
versal. This fact can easily be accomodated because the 
possibility of discontinuity (and the dependency types 
across which the modifier may be extracted) is described 
in the lexical entry of the verb. In fact, a universal re- 
striction could not even be stated because the treatment 
is completely lexicalized. 



Def: An order domain (over W) m is a set of 
words from W where \/w\,W2,wz € W : (w± -< 
W2 ~< W3 A u>i £ m A W3 G m) W2 € m. 
Def: An order domain structure (over W) M 
is a set of order domains where Vm, m' € M. : 
m fl m' = V m C m' V w! C m. 
Def: A dependency structure T is a 
tuple (W, w r , Rt>, Va, Vc, M., Vm) where 
(W,w r ,Rx>,V_A,Vc) is a dependency tree, M. 
is an order domain structure over W, and 
Vm : W 1 — ^ A4 n maps words to order domain 
sequences. 

Additionally, we require for a dependency 
structure four more conditions: (1) Each word 
w is contained in exactly one of the domains 
from Vm(w), (2) all domains in Vm(u>) are pair- 
wise disjoint, (3) each word (except w r ) is con- 
tained in at least two domains, one of which 
is associated with a (transitive) head, and (4) 
the (partial) ordering of domains (as described 
by Vm) is consistent with the precedenc e of the 
words contained in the domains (see ( |Broker 



1997) for more details). 



4.2 The Language £73 

FigJH defines the logical language Cx> used to 
describe dependency structures. Although they 
have been presented differently, they can eas- 
ily be rewritten as (multimodal) Kripke mod- 
els: The dependency relation R^ is represented 
as modality (d) and the mapping from a word to 
its ith order domain as modality o % M .[] All other 
formulae denote properties of nodes, and can be 
formulated as unary predicates - most evident 
for word class and feature assignment. For the 
precedence predicates <* and <<5, there are in- 
verses >* and ><j. For presentation, the relation 
places C W x W has been introduced, which 
holds between two words iff the first argument 
is the positional head of the second argument. 

A more elaborate definition of dependency 
structures and Cx> defines two more dimensions, 
a feature graph mapped off the dependency tree 
much like the proposal of [Blackburn (1994 ), and 
a conceptual representation based on termino- 
logical logic, linking content words with refer- 
ence objects and dependencies with conceptual 
roles. 



5 The modality D % M can be viewed as an abbreviation 
of <>' M D M , composed of a mapping from a word to its ith 
order domain and from that domain to all its elements. 



Syntax (valid formulae) 



h <* 



Semantics (satisfaction relation) 



c = VcH 

^3ro'eW: AT,w' \= cj) 

<^ 3m £ M : (Vm(w) = (■ ■ -m- ■ ■) 
AVu/ G m : (w = w' V w -< z//)) 
->Ekt/, u>", w'" 6 W : places(w', w) 
Aplaces(w', w") A w"'Rsw A «/" -< w 
:<^> 3«/, w" G W : wR v wA 



c G £p, Vc G C 
a G £x>, Va G .4 

(d}0 G £d, VdeV^eCv 
<* G £x>, 

< 5 G £x>, V<5 C 2? 

Ta G C v , Vo C X> 



single eC v ,yie IN 
o^filled G £p, Vi £ N 

cf> ai/j e Ct> , y<f>, ip G £x> 

-|0 G £d, V0 G £d 



T, w 
T, w; 
T, i« 
T, it) 

T, w 

T, w 



places(w" , w) A w" R* s w'o 

w' g $ ; (KmH)a 

-<3w" : (w"R v w'A\ < 1 
w" G $ ( (KmH)) 
|= oj^filled ^(V>(M)|>1 

|= A ^ T,w^andT,wH 
|= -10 :4=> not T,w\=ip 



T, to 
T, w 
T, w 
T, i/; 



Figure 3: Syntax and Semantics of Cx> Formulae 



Vf in ^o^,(single A filled) A D^initial [1] 

A □^(middle A norel) ' ' [2] 

A single A (final A norel) [3] 

AV2 & (middleA <* AD^norel) [4] 

A VEnd ^ (middleA >*) ' [5] 

A VI <=> (initial A norel) [6] 

Figure 4: Domain Description of finite verbs 



"hat" AVfin [7] 

A (subj) ( "Junge" A T0) [8] 

A(vpart) ( "gesehen" A f [9] 

A ->f inal A > {S ubj,obj} [ 10 ] 

A (obj) ("Mann" A t{vpart})) [H] 

Figure 5: Hierachical Structure 



5 The German Clause 

Traditionally, the German main clause is de- 
scribed using three topological fields; the ini- 
tial and middle fields are separated by the fi- 
nite (auxiliary) verb, and the middle and the 
final fields by infinite verb parts such as sepa- 
rable prefixes or participles. We will generalize 
this field structure to verb-initial and verb-final 
clauses as well, without going into the linguistic 
motivation due to space limits. 

The formula in Fig.|| states that all finite 
verbs (word class Vf in £ C) define three order 
domains, of which the first requires exactly one 
element with the feature initial [1], the second 
allows an unspecified number of elements with 
features middle and norel [2], and the third al- 
lows at most one element with features final 
and norel [3]. The features initial, middle, 
and final £ A serve to restrict placement of 
certain phrases in specific fields; e.g., no reflex- 
ive pronouns can appear in the final field. The 
norel £ A feature controls placement of a rela- 
tive NP or PP, which may appear in the initial 
field only in verb-final clauses. The order types 



are defined as follows: In a verb-second clause 
(feature V2), the verb is placed at the begin- 
ning (<*) of the middle field (middle), and the 
element of the initial field cannot be a relative 
phrase (o^norel in [4]). In a verb-final clause 
(VEnd), the verb is placed at the end (>*) of the 
middle field, with no restrictions for the initial 
field (relative clauses and non-relative verb-final 
clauses are subordinated to the noun and con- 
junction, resp.) [5]. In a verb-initial clause (VI), 
the verb occupies the initial field [6]. 

The formula in Fig.|| encodes the hierarchical 
structure from Fig.[l] and contains lexical restric- 
tions on placement and extraction (the surface 
is used to identify the word). Given this, the 
order type of "hat" is determined as follows: 
The participle may not be extraposed (-if inal 
in [10]; a restriction from the lexical entry of 
"hat"), it must follow "hat" in d%. Thus, the 
verb cannot be of order type VEnd, which would 
require it to be the last element in its domain 
(>* in [5]). "Mann" is not adjacent to "gese- 
hen" , but may be extracted across the depen- 
dency vpart (Tjvpart} i n [1 allowing its in- 
sertion into a domain defined by "hat" . It can- 



not precede "hat" in d?, because "hat" must ei- 
ther begin d% (due to <* in [4]) or itself go into 
d\. But d% allows only one phrase (single), 
leaving only the domain structure from Figj2], 
and thus the order type V2 for "hat" . 

6 Comparison to PSG Approaches 

One feature of word order domains is that they 
factor ordering alternatives from the syntactic 
tree, much like feature annotations do for mor- 
phological alternatives. Other lexicalized gram- 
mars collapse syntactic and ordering informa- 
tion and are forced to represent ordering alter- 
natives by lexical ambiguity, most notable L- 
TAG ( Schabes et al,, 1988| ) and some versions 
of CG QHepple, 1994j ). This is not necessary 
in our approach, which drastically reduces the 
search space for parsing. 

This property is shared by the proposal of 
Rcapc (1993 ) to associate HPSG signs with se- 
quences of constituents, also called word or- 
der domains. Surface ordering is determined 
by the sequence of constituents associated with 
the root node. The order domain of a mother 
node is the sequence union of the order domains 
of the daughter nodes, which means that the 
relative order of elements in an order domain 
is retained, but material from several domains 
may be interleaved, resulting in discontinuities. 
Whether an order domain allows interleaving 
with other domains is a parameter of the con- 
stituent. This approach is very similar to ours 
in that order domains separate word order from 
the syntactic tree, but there is one important 
difference: Word order domains in HPSG do 
not completely free the hierarchical structure 
from ordering considerations, because disconti- 
nuity is specified per phrase, not per modifier. 
For example, two projections are required for 
an NP, the lower one for the continuous mate- 
rial (determiner, adjective, noun, genitival and 
prepositional attributes) and the higher one for 
the possibly discontinuous relative clause. This 
dependence of hierarchical structure on order- 
ing is absent from our proposal. 

We may also compare our approach with the 



projection architecture of LFG (Kaplan & Bres- 
nan, 1982| ; [Kaplan, 19951 ) . There is a close sim- 



tively). C-structure and order domains repre- 
sent surface ordering, whereas f-structure and 
dependency tree show the subcategorization or 
valence requirements. What is more, these pro- 
jections or dimensions are linked in both ac- 
counts by an element-wise mapping. The dif- 
ference between the two architectures lies in the 
linkage of the projections or dimensions: LFG 
maps f-structure off c-structure. In contrast, 
the dependency relation is taken to be primi- 
tive here, and ordering restrictions are taken to 
be indicators or consequences of dependency re- 
lations (see also Prokerj ( |1998b| , [19M^ )). 



7 Conclusion 

We have presented an approach to word or- 
der for DG which combines traditional notions 
(semantically motivated dependencies, topolog- 
ical fields) with contemporary techniques (log- 
ical description language, model-theoretic se- 
mantics). Word order domains are sets of par- 
tially ordered words associated with words. A 
word is contained in an order domain of its 
head, or may float into an order domain of a 
transitive head, resulting in a discontinuous de- 
pendency tree while retaining a projective order 
domain structure. Restrictions on the floating 
are expressed in a lexicalized fashion in terms 
of dependency relations. An important benefit 
is that the proposal is lexicalized without re- 
verting to lexical ambiguity to represent order 
variation, thus profiting even more from the effi- 
ciency considerations discussed by [Schabes et al. 
(T988|). 



It is not yet clear what the generative capac- 
ity of such lexicalized discontinuous DGs is, but 
at least some index languages (such as a n b n c n ) 
can be characterized. |Neuhaus fc Broker (1997^ ) 



have shown that recognition and parsing of such 
grammars is A^P-complete. A parser operating 
on the model structures is described in (Hahn 
let al., 1997|) . 



ilarity of the LFG projections (c-structure and 
f-structure) to the dimensions used here (order 
domain structure and dependency tree, respec- 
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